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FACTORIZATIONS OF THE FIBONACCI INFINITE WORD 


GABRIELE FICI 


Abstract. The aim of this note is to survey the factorizations of the Fibonacci infinite word that 
make use of the Fibonacci words and other related words, and to show that all these factorizations 
can be easily derived in sequence starting from elementary properties of the Fibonacci numbers. 
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1. Preliminaries 

The well-known sequence of Fibonacci numbers (sequence A000045 in the On-Line Encyclopedia 
of Integer Sequences) is defined by Fi = 1, F 2 = 1 and for every n > 2, F^ = F^-i -f Fn- 2 - The 
first few values of the sequence Fn are reported in Table for reference. 

A basic property of Fibonacci numbers (that can be easily proved by induction) is that 1 plus 
the sum of the first n Fibonacci numbers is equal to the (n -I- 2)-th Fibonacci number: 


( 1 ) 1 + Fj — Fn+2- 

i=l 

We recall here a famous result, usually attributed to Zeckendorf [13], but published earlier by 
Lekkerkerker |5] and which, in fact, is a special case of an older and more general result due to 
Ostrowski m- It permits us to use Fibonacci numbers as a basis for representing integers: 

Theorem 1. Every positive integer can be expressed uniquely as the sum of one or more distinct 
non-consecutive Fibonacci numbers Fn, n > 1. 

For example, 17 = 13-1-3-1-1 = Fj -\- F 4 -\- F 2 , and there is no other way to write 17 as 
the sum of non-consecutive Fibonacci numbers (assuming the convention that Fi is not used in 
the representation). Thus, one can represent natural numbers as strings of 0-1 bits, where the 
i-th bit (from the right) encodes the presence/absence of the {i -\- l)-th Fibonacci number in the 
representation given by Theorem So for example the number 17 is represented by 100101. We 
call this representation of natural numbers the Zeckendorf representation. 

The first few natural numbers and their Zeckendorf representations are displayed in Table 
where we padded to the left with Os in order to have strings of the same length. Note that with 
6 bits one can represent the first 21 natural numbers. In fact, for every n > 0, there are exactly 
Fn integers whose leftmost 1 in the Zeckendorf representation is in position n (starting from the 
right). From Q, we derive that one needs n bits to represent the first Fn +2 natural numbers. 

The strings of length n forming the Zeckendorf representations of the first Fn +2 natural numbers 
are precisely all the 0-1 strings of length n not containing two consecutive Is. These strings are in 
lexicographic order if the natural numbers are in increasing order from 0 to Fn +2 ~ 1- 

Let us define f{n), for every n > 0, as the rightmost digit of the Zeckendorf representation of 
n. For every n > 1 we define the n-th Fibonacci word as the string 

/n=/(0)/(l)---/(F„-l) 

of length \fn\ = Fn- By convention, we set /i = 1. The first few Fibonacci words are shown in 

Tables 
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We also define the Fibonacci infinite word f as the limit of /„ as n goes to infinity. That is, / 
is the infinite word whose n-th letter is the “parity” of the Zeckendorf representation of n: 

f = /(0)/(l)/(2)/(3) • • • = 0100101001001010010 • • • 

In the Zeckendorf representation of an integer, when the n-th digit from the right is a 1, the 
(n — l)-th digit from the right is a 0. Hence, the rightmost n — 2 digits of the Zeckendorf represen¬ 
tations of the natural numbers from F„^i to T '„+2 ~ 1 are the same rightmost n — 2 digits of the 
Zeckendorf representations of the first Fn natural numbers. For example, the 2 rightmost digits of 
the Zeckendorf representations of 5, 6 and 7 are, respectively, 00, 01, 10, as well as the 2 rightmost 
digits of the Zeckendorf representations of the 0, 1 and 2. We deduce that for every n > 2, one has 

(2) fn = fn-lfn-2- 

For more details on Fibonacci words the reader can see, for instance, [T]. 

Recall that a factorization of an infinite word ic is a sequence (xn)n>i of finite words such that 
w can be expressed as the concatenation of the elements of the sequence, i.e., w = T\n>l^n- 

In general, exhibiting a factorization (xn)n>i of an infinite word w can be useful to better 
understand the combinatorics of w, provided the sequence (x„)„>i has non-trivial combinatorial 
properties—for example, all the words in the sequence are palindromes, squares, or prefixes of w. 

Another point of view consists in defining a factorization by some general rule that can be 
applied to any infinite word. The sequence (Xn)n>i is therefore determined by the particular 
instance of the infinite word w (as is the case, for example, in the Lempel-Ziv factorization or in 
the Lyndon factorization, that we will see below). In this case, the word w can have particular 
properties that make it a limit example for that particular factorization. 

In next sections, we will show a number of factorizations of the Fibonacci infinite word that 
make use of the Fibonacci finite words and other related words. These factorizations have been 
introduced over the time in different papers, and we think it can be useful to collect them all 
together for reference. We also add some (at least to the best of our knowledge) novel factoriza¬ 
tions. Moreover, we present these factorizations in an order that allows us to provide a short and 
elementary proof for each of them, despite the original proofs being sometimes more involved or 
more technical. 


2. Fibonacci words and co-Fibonacci words 

The first factorization of the Fibonacci infinite word we exhibit is the following. 

Proposition 1. The Fibonacci infinite word can be obtained by concatenating 0 and the Fibonacci 
words: 

( 3 ) / = 0 []/„ 

n>l 

= O-l-O-Ol-OlO-OlOOl-01001010••• 

Proof. Since for every z > 1, |/i| = F(, it is sufficient to prove that, for every n > 1, /„ occurs in 
/ starting at position 1 -|- Yl'iZi Pi = Pn+i- From ([^, we have /„+2 = fn+ifn, so that /„ has an 
occurrence in / starting at position |/n-i-i| = Fn+i- □ 

Let us consider the sequence of the palindromic prefixes of /, also called central words. The 
first few values of the sequence are displayed in Tablewhere e denotes the empty word, i.e., 
the word of length 0. 

As it is well-known, for every n > 3, is obtained from /„ by removing the last two letters. 
More precisely, we have for every n > 1, 

(4) /2n+l = P2n+l01, /2n+2 = P2 ti+2 10. 

The fundamental property of the central words is the following: 
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Lemma 1. For every n > 2 one has 

P 2 n +1 = P2n-l01p2n = P2n^0p2n+l, P 2 n +2 = P 2 nWp 2 n+l = P2n+l01p2n- 

Proof. Follows immediately from ([^ and Q. □ 

Remark 1. It is easy to see from Q that for every n> A, one has fn = fn- 2 fn- 3 fn- 2 - We have 
therefore from @; 

(5) / = 01001 HUfn-lfn 

n >2 

= 01001 • (0 • 1 • 0)(01 • 0 • 01)(010 • 01 • 010)(01001 • 010 • 01001) • • • 

Analogously, since 1 = fs, we can write 

(6) / = OlOOUfn-lfnfu-l 

n >2 

= 0100 • (1 • 0 • 1)(0 • 01 • 0)(01 • 010 • 01)(010 • 01001 • 010) • • • 

We now introduce a class of words that we call the co-Fibonacci words. Although this class has 
appeared previously in the literature [5] , to the best of our knowledge no one has yet given a name 
to them. 

The co-Fibonacci words fl^ are defined by complementing the last two letters in the Fibonacci 
words fn, that is, f^ = PnVX, where x and y are the letters such that /„ = PnXy. Equivalently, 
co-Fibonacci words can be defined by fn = fn- 2 fn-i for every n > 3. The first few co-Fibonacci 
words are displayed in Table 

The following lemma is a direct consequence of Lemma 

Lemma 2. For every n > 2 one has 

f 2 n-\-l — /2n/2n-l5 f 2 n +2 = f 2 nf 2 n+l- 

Proposition 2. The Fibonacci word can be obtained by concatenating 0 and the odd co-Fibonacci 
words: 


( 7 ) 


/ — 0 f2n+l 


n>l 


= 0 - 10 - 01010-0100101001010 • 
Proof. Follows directly from @ replacing f 2 n-if 2 n with f 2 n+i- 
Analogously, we have the following: 


□ 


Proposition 3. The Fibonacci word can be obtained by concatenating 01 and the even co-Fibonacci 
words: 

( 8 ) 


/ = 0in/2n+2 


n>l 


= 01 - 001 - 01001001 - 010010100100101001010 - 
Proof. Follows directly from @ replacing / 2 „/ 2 n-si with f 2 n+ 2 - 

3. Singular words 


□ 


Let us define the left rotation of a non-empty word w = wiW 2 • ■ ■ Wn, Wi letters, as the word 
= WnWi - - - Wn-i- Analogously, the right rotation of w is defined as the word = W 2 ■ ■ ■ WnWi. 
The singular words fn are defined by complementing the first letter in the left rotations of the 
Fibonacci words fn- The first few singular words are displayed in Table Note that for every 
n > 1, one has f 2 n+i = 0 p 2 n-i-i 0 and / 2„+2 = lp 2 n-i- 2 l - 

The singular words are palindromic factors of / but do not appear as prefixes of / (by the way, 
/ also contains other palindromic factors besides the central words and the singular words /„, 
e.g., 1001 or 01010, see [5] for more details). Their name comes from the fact that among the 
Fn-\- 1 factors of / of length Fn, there are Fn of them that can be obtained one from each other 
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by iteratively applying left (or equivalently right) rotation and one, the singular word, whose left 
(or equivalently right) rotation is not a factor of /. 

Wen and Wen m proved that the Fibonacci infinite word can be obtained by concatenating 
the singular words: 

Proposition 4. The Fibonacci infinite word is the concatenation of the singular words: 

(9) f = Ilf - 

n>l 

= 0 • 1 • 00 • 101 • 00100 • 10100101 • • • 

Proof. Indeed, © follows directly from ([^ and the definition of singular words, observing that 
the Fibonacci words end by letter 0 and 1 alternatingly. □ 


The factorization ([^ is in fact the Lempel-Ziv factorization of /. The Lempel-Ziv factorization 
is a factorization widely used in computer science for compressing strings M- The Lempel-Ziv 
factorization of a word w is w = wiW 2 • ■ ■ where wi is the first letter of w and for every i>2,Wi is 
the shortest prefix of WiWi+i ■ ■ ■ that occurs only once in the word WiW 2 ■ ■ ■ Wi. Roughly speaking, 
at each step one searches for the shortest factor that did not appear before. 

Remark 2. It is easy to see, using for example (^, and the definition of singular words, that 
for every n > 4, /„ = fn- 2 fn- 3 fn- 2 - Therefore, from we have 

f = OlOOUfJn-lfn 


( 10 ) 


^>2 


= 0100 • (1 • 0 • 1)(00 • 1 • 00)(101 • 00 • 101)(00100 • 101 • 00100 ) 


Since 0 = fi, we hence obtain 

(11) / = 010 n /n-l/n/n-l 


%>2 


= 010 • (0 • 1 • 0)(1 • 00 • 1)(00 • 101 • 00)(101 • 00100 • 101 ) 


The factorization (11) is a sort of dual with Lucas numbers of the factorization in singular 
words Indeed, the sequence of factor lengths in & is the sequence of Fibonacci numbers, 

while if in (11) one decomposes the first term as 01 • 0, then the sequence of factor lengths is the 
sequence of Lucas numbers (sequence A000032 in the On-Line Encyclopedia of Integer Sequences): 
2,1,3,4,7,11, etc. 


4 . Christoffel words 

The lower Christoffel words are defined by c„ = 0p„l, for every n > 3. The lower Christoffel 
words are the Lyndon factors of /, i.e., they are lexicographically smaller than any of their proper 
suffixes (with respect to the order induced by 0 < 1). 

If in the Euclidean plane one interprets each 0 by a horizontal unitary step and each 1 with a 
vertical unitary step, the lower Christoffel word c„ is the best grid approximation from below of 
the segment joining the point (0,0) to the point (F„_i,F„_ 2 ) (see Figure[^. 

Analogously, the upper Christoffel words are defined by c), = IpnO, for every n > 3. Therefore, 
the upper Christoffel words are the reversals of the lower Christoffel words (we use the notation 
w for the reversal, a.k.a. mirror image, of the word w). The upper Christoffel words are the anti- 
Lyndon factors of /, i.e., they are lexicographically greater than any of their proper suffixes (with 
respect to the order induced by 0 < 1). Moreover, Cn is the best grid approximation from above 
of the segment joining the point (0,0) to the point (Fji_i, F„_ 2 ). 

Remark 3. For every n > 3, Cn = fn if n is even, c„ = ff) if n is odd. Therefore, cf = fn if‘' t- 
is even, cf = fn if i^ 
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Figure 1. The lower Christoffel word C7 = 0010010100101 (left) and the upper 
Christoffel word 07 = 1010010100100 (right) are the best grid approximations, 
respectively from above and from below, of the Euclidean segment joining the 
points (0,0) and (8,5) = {Fq,F^). 


Lemma 3. For every n > 2 one has 


C2n+1 — C2nC2n— 


C2n+2 — C 2 nC 2 n+l 5 


and therefore 


C2n+1 — C2n—1^2715 


C2n+2 — C2n+lC2n- 


Proof. The first part follows from Lemma by applying the right rotation to each side of the 
equalities. The second part follows from the first by applying the reversal. □ 


The following result states that every Christoffel word is the product of two singular words. 
Lemma 4. For every n > 1 one has 

• C2n+1 = /2n-l/2n 

• C2n+2 = /2n+l/2n 

and therefore 

• C 2 ri +1 = f2nf2n-l 

• C2n+2 = /2n/2n+l 

Proof. Follows directly from Lemma and the definitions of Christoffel and singular words. □ 

Melangon [9] proved that the Fibonacci word is the concatenation of the odd lower Christoffel 
words: 


Proposition 5. The Fibonacci word is the concatenation of the odd lower Christoffel words: 

( 12 ) / = l[c2n+l 

n>l 

= 01-00101 • 0010010100101-•• 

Proof. Follows directly from ([^ and Lemma 


□ 


Actually, Melangon proved that (12 1 is precisely the Lyndon factorization of /. Recall that 
the Lyndon factorization of a word w is w = £ 1^2 ■ ■ ■, where each ii is a Lyndon word and is 
lexicographically greater than or equal to £i+i. The uniqueness of such a factorization for finite 
words is a well-known theorem of Chen, Fox and Lyndon [3]. Siromoney et al. extended this 
factorization to infinite words m- 

Symmetrically, we have the following: 
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Proposition 6 . The Fibonacci word is the concatenation o/O and the even upper Christojfel words: 

(13) / = o[]ci; 


n>2 


= 0 • 100 • 10100100 • 101001010010010100100 • 
Proof. Follows directly from ([^ and Lemma 


□ 


In fact, it is easy to see that (13) is the Lyndon factorization of / if one takes the order induced 
by 1 < 0 . 

We now present two other factorizations based on Christoffel words. To the best of our knowl¬ 
edge, these factorizations did not appear before in literature. 


Proposition 7. The Fibonacci word is the concatenation of 010 and the lower Christoffel words 
where each odd lower Christoffel word is squared: 

(14) / = 010 n ^n+l^‘in+2 

n>l 

= 010 • (01 • 01 • 001)(00101 • 00101 • 00100101 ) • • • 

Proof. Follows directly from ([^ and LemmaIndeed, by Lemmawe have 

C2n+lC2r!,+ lC2n+2 = f2n-lf2n ‘ f2n-lf2n ‘ f2n+lf2n = {f2n-lf2nf2n-l){f2nf2n+lf2n)- 

□ 


Analogously, we have the following: 

Proposition 8. The Fibonacci word is the concatenation of 0100 and the upper Christoffel words 
where each even upper Christoffel word is squared: 

(15) / = 0100 n C2n+lC2n+2 

n>l 

= 0100 • (10 • 100 • 100)(10100 • 10100100 • 10100100 ) • • • 

Proof. Follows directly from ([^ and LemmaIndeed, by Lemmawe have 

^271+1^271+2^271+2 y*27i/*27i—1 ‘ f2nf2n+l ' f2nf2n+l (J^27i/*27i—1/*27i) (/*2n+l/*2n./27i+l) ■ 

□ 


5. Reversals of Fibonacci words 


One of the most known factorizations of the Fibonacci infinite word, and perhaps the most 
surprising, is the following. 

Proposition 9. The Fibonacci word can he obtained also by concatenating the reversals of the 
Fibonacci words: 


(16) 


f = l[fn 


71>2 

= 0 - 10 - 010 - 10010-01010010 • 


Proof. It follows from the definitions that taking the right rot ation of /„ and complementing the 
last letter produces the n-th singular word /„. Therefore, (16 1 follows directly from ^ observing 
that the reversals of the Fibonacci words start with 0 and 1 alternatingly. □ 


The factorization (16) is basically the Crochemore factorization of /—the only difference is that 


the Crochemore factorization starts with 0, 1, 0 and then coincides with the one above (see 0 )- 
Recall that the Crochemore factorization of w is w = C 1 C 2 - - - where Ci is the first letter of w 
and for every i > 1, Ci is either a fresh letter or the longest prefix of CiCi+i ■ ■ ■ occurring twice in 
/1/2 ■ ■ ■ fi- For example, the Crochemore factorization of the word w = 0101001 is 0 - 1 - 010 - 01, 
since 010 occurs twice in 01010. 

In 1995, de Luca considered the following factorization: 
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Proposition 10. The Fibonacci word can be obtained by concatenating the reversals of the even 
Fibonacci words. 


(17) 


/ = n/2. 


n>2 

= 010 ■ 01010010 ■ 010100101001001010010 ■ 


Proof. Applying the reversal toj2|^e have that /„ = fn-ifn-i^ for every n > 2. So (17) follows 
directly from (jl^ by replacing f 2 n- 2 f 2 n-i with / 2 „. □ 

In [5] de Luca proved that the factorization ( |17[ ) has the following minimal property with respect 
to the lexicographical order: any non-trivial permutation of a finite number of the factors will 
produce an infinite word that is lexicographically greater than /. 

Concatenating the reversals of the odd Fibonacci words instead of even ones still produces the 
Fibonacci word, if one prepends a 0: 

Proposition 11. The Fibonacci word can be obtained by concatenating 0 and the reversals of the 
odd Fibonacci words: 


(18) 


/ — 0 11 f2n+l 


n>2 

= 0 • 10010 • 1001001010010 - 


Proof. Follows directly from (16) by replacing f 2 n-if 2 n with f 2 n+i- 


□ 


Recently [7], studying the so-called open and closed words, the following factorization has been 
proved: 

Proposition 12. The Fibonacci word can be obtained by concatenating 01 and the squares of the 
reversals of the Fibonacci words: 

(19) ^ - n-,TT /~^2 


/ = 0lll(/n)' 


n>2 


= 01 • (0 • 0)(10 • 10)(010 • 010)(10010 • 10010 ) ■ 


Proof. Recalling that for every n > 3, one has /„ = /n- 2 /n-i, we have, from (16), that / = 

hhh • • • = 0 • hhhh • • • = 01 • ihfyiMs) • • • = 01 Un>2iZ)^- n 

6. Generalization to standard Sturmian words 


The Fibonacci word is the most prominent example of a standard Sturmian word. Let a be 
an irrational number such that 0 < a < 1 , and let [ 0 ; di + 1 , ^ 2 ) da, • • •] be the continued fraction 
expansion of a. The sequence of words defined by Si = 1, S2 = 0 and = s’^fFi Sn-2 for n> 3, 
converges to the infinite word Wa, called the standard Sturmian word of slope a. The sequence 
of words Sn is called the standard sequence of Wa- The Fibonacci word is the standard Sturmian 
word of slope a = (3 — V5)/2 and its standard sequence is the sequence of Fibonacci finite words, 
since one has di = 1 for every z > 1. 

Most of the factorizations we described in this note can be generalized to any standard Sturmian 
word. However, the proofs become more technical and less easy to present. 
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n 


1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 

1 1 2 3 5 8 13 21 34 55 89 144 233 377 610 987 1597 2584 4181 6765 

Table 1. The first few values of the sequence of Fibonacci numbers. 


Zeck. 

decimal 

Zeck. 

decimal 

Zeck. 

decimal 

000000 

0 

010000 

8 

100100 

16 

000001 

1 

010001 

9 

100101 

17 

000010 

2 

010010 

10 

101000 

18 

000100 

3 

010100 

11 

101001 

19 

000101 

4 

010101 

12 

101010 

20 

001000 

5 

100000 

13 



001001 

6 

100001 

14 



001010 

7 

100010 

15 




Table 2. The Zeckendorf representations of the first few natural numbers coded 
with 6 bits. 


/i = l 
/2 =0 
/a = 01 

U = 010 

/s = 01001 

fe = 01001010 

/t = 0100101001001 

fs = 010010100100101001010 

/g = 0100101001001010010100100101001001 

Table 3. The first few Fibonacci words. 


P3=£ 

P4 = 0 

P5 = 010 

pe = 010010 

P7 = 01001010010 

PS = 0100101001001010010 

pg = 01001010010010100101001001010010 

Table 4. The first few central words. 
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/^lo 
/4 = 001 
/s = 01010 

/g = 01001001 

/' = 0100101001010 

/s = 010010100100101001010 

Table 5. The first few co-Fibonacci words. 

/i = 0 
/2 = 1 
/3 = 00 
h = 101 
h = 00100 

fe = 10100101 

Table 6 . The first few singular words. 

C3 = 01 
C4 = 001 
C5 = 00101 
C6 = 00100101 
C7 = 0010010100101 
C8 = 001001010010010100101 
Table 7. The first few lower Christoffel words. 


C~3 = 10 
C4 = 100 
85 = 10100 

£e = 10100100 

cr = 1010010100100 

6s = 101001010010010100100 


Table 8. The first few upper Christoffel words. 



